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Aim of this work: 



Putting forward a class of optimization methods 
largely overlooked in systems and control: 
the direct search methods. 



They can however be very adequate and powerful 
for many complex problems (possibly including yours). 
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Direct search optimization 



Direct search optimization) 

Direct search optimization methods 

• What are these methods? 

• Why are they overlooked in systems and control? 

• Why should they be used? 
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Direct search optimization) 

What are these methods? 



• The aim is to min(/max)imize an objective function 
f (x) : W — > M, possibly under constraints, 

starting from one (or several) feasible initial solution(s). 

• Only use function evaluations to decide how to explore R", 

• need no gradient or Hessian information. 

• In this talk, direct search = derivative-free optimization 

• See e.g. webpages of Luis Vicente and Charles Audet. 
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Direct search optimization! 

Why are they overlooked in systems and control?: A history 

• 1957-1961: The birth (Box, Davidon, Hooke, Jeeves) 

• 1961-1971: The golden age (1965 Nelder-Mead) 

• Very efficient in practice. 

• 1972-1989: The downfall (after survey of W. Swann 1972) 

• No proofs of convergence and can sometimes be slow. 

• 1990-...: The resurrection = Proofs of convergence 

• Torczon and Dennis: MDS, proof on smooth f(x): 1997 

• Audet and Dennis: MADS, proof on non-smooth f"(x): 2003 

• Vicente and Custodio: proof on discontinuous f(x): 2010 

Last item arrived too late for systems and control: 
From the early nineties, our field got saturated by convex 
optimization and LMIs, solvable very efficiently but... 
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Why should they be used? (1/2) 

• Current problems of interest have non-convex feasible sets. 

• The best that can be done is to propose methods with 
guaranteed convergence to locally optimal solutions. 

• There exist (new) direct search methods guaranteed to 
converge even on most non-smooth or discontinuous f(x). 
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Why should they be used? (2/2) 

When (clean) gradients of f(x) are not (easily) available. 

The main advantages are: 

• Recent strong convergence guarantees 

• Exponential performance of computers 

• Simulations are more routine and accurate 

• Ease of use and implementation 
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Optimization methods: A comparison 



A basic comparison of optimization methods: 



Class of methods 


Computational time 


Adequate problems 


Convex/LMI 


Very efficient 1 


Many specific 
(sub)problems 


Gradient-based 


Efficient 


Clean derivatives 
must be available 


Derivative-free 


May be slow 


'Any' f(x) : M" -> R 
(n < 25 - 100) 



"""May need to introduce a lot of additional variables to convexify 
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Problems in the paper (SOF) 
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The problems considered in the paper (1/2) 

Static Output Feedback optimization, m x p MIMO LTI systems: 
1) Stabilization of the closed-loop: 

Find K e R mxp s.t. a {A + BKC) < (continuous time) 

2 and 3) Min. the %2 or T-L^ norm of the performance channel: 

min K \\T WZ (K, s)\\ 2 or min K \\T WZ (K , s)^, s.t. a(A + BKC) < 

With unstable models from COMPlib : a library of actual and 
academic models, currently often used for benchmarking. 
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The problems considered in the paper (2/2) 



Non-convex: LM I s (and iterative LMI schemes rarely converge) 

The objectives admit however gradients or Clarke subgradients, 
which should then be used for optimization purposes. 

This is implemented in the methods HIFOO and hinf struct. 

Paper's benchmarks: comparison of the objectives values reached 
and computational times required, by a DS method and by HIFOO. 

Motivation: to verify convergence and assess performance of DS. 
Then other f(x) without gradients can be considered. 
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Note on the direct search method used 



Currently, the best two derivative-free methods are certainly MADS 
and SID-PSM, having strong theoretical convergence guarantees. 

In practice, a very efficient method easily accessible and usable is 
the Nelder-Mead algorithm restarted at the last solution until no 
improvement is obtained (to a given accuracy). 

This generates a set of dense (enough) exploring directions in the 
search space M." (but without formal theoretical guarantee). 

Packs a serious punch in practice, sufficing to make the point. 
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Summary of 13000 tests 



The direct search method converges as expected, and is reasonably 
fast (meant for off-line design), despite not using gradients: 

. 1) Stabilization success: DS 92.3%, HIFOO 90.6%. 
Ratio computational time DS on HIFOO = 0.74 

• 2) Performance channel H.2 norm minimization: 

Similar: 60% / Better for DS: 14% / Better for HIFOO: 26% 
Ratio computational time DS on HIFOO = 10 

• 3) Performance channel T-Loo norm minimization: 

Similar: 40% / Better for DS: 23% / Better for HIFOO: 37% 
Ratio computational time DS on HIFOO = 1.3 
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Other problem, without gradient 
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Time-response shaping problem 



Objective considered: to optimize explicitly the features of a time 
response z(x, t), with the controller parameters x G R". 

Interesting objective: f(x) = t r + A maxd ev , where: 

t r G M + is the rise time needed by the response z(x, t) to reach a 
desired settling region (above z m3X (t) and under z mm (t)), 

maxdev^ M + is the maximum deviation of the response z(x, t) 
outside of the desired region (for t > above, and t > t r under) 

and A € M + is the scalar trade-off parameter. 
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Difficulty and solution 



The objective max^ ev is non-smooth but locally Lipschitz: 

• Subgradients can be used for minimization to locally optimal 
solutions (see Bompart, Apkarian and Noll 2008). 

• (New) DS methods also converge on max^ ev (but slower). 

The objective ty is however more difficult: 

• It is actually discontinuous and no gradient analysis exists. 

• (New) DS have guarantees of convergence for such objectives! 
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Two results 



Time-response z(x, t) shaping examples: 

Optimization of PID parameters (x = [K p , Kj, Kj]), by min x f(x). 
Choice of time envelope: z mm {t) = 0.98 and z max (t) = 1.02 

Two animated figures present two optimizations. 
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Results animation (1/2) 

Initial solution: Ziegler-Nichols parameters. 





K = 4.059, K. = 0.4388, K = 9.386 : f(x) = 0.5505 

p _i d _ | 



40 60 
time t [s] 
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K < « > > » - *K + 
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(click on figure to launch animation) 
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Results animation (2/2) 

Initial solution: random parameters, not stabilizing. 
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Take-home messages 



• There definitely exists alternatives to LMIs. 

Which method is adequate for the considered problem? 

• Direct search methods now have strong theoretical 
convergence guarantees, and are acceptable in practice 
(efficient computers, methods and simulators). 

• Should be much more considered for many open 
problems of optimization in systems and control. 
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Other problem, without gradient 



Thank you for your attention! 



Questions? 



Emile.Simon@uclouvain.be 
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